A DTW-based DAG technique for speech and speaker feature analysis
نویسنده
چکیده
A DTW-based directed acyclic graph (DAG) optimization method is proposed to exploit the interaction information of speech and speaker in feature component. We introduce the DAG representation of intra-class samples based on dynamic time warping (DTW) measure and propose two criteria based on in-degree of DAG. Combined with (l − r) optimization algorithm, the DTW-based DAG model is applied to discuss the feature subset information of representing speech and speaker in text-dependent speaker identification and speaker-dependent speech recognition. The experimental results demonstrate the powerful ability of our model to reveal the low dimensional performance and the influence of speech and speaker information in different tasks,and the corresponding DTW recognition rates are also calculated for comparison.
منابع مشابه
Comparing DTW-Based and HMM-Based Text- Dependent Speaker Verification Algorithms
Speaker verification is among the widely used biometrics which usually offer more secure authentication for user access than regular passwords. In this final project, we study the DTW-based and HMM-based speaker verification algorithms and a comparison between them is made based on their performances on our recorded dataset. The two feature sets commonly used in Speech Recognition Systems, LPC ...
متن کاملEffect of Dynamic time Warping Based Alignment on the Accuracy of the Transformation Function for Voice Conversion
Absract--Voice conversion involves transformation of speaker characteristics in a speech uttered by a speaker called source speaker to generate a speech having voice characteristics of a desired speaker called the target speaker. Voice conversion is used in many applications namely dubbing, to enhance the quality of the speech, text-to-speech synthesizers, online games, multimedia, music, cross...
متن کاملFast Speaker Recognition using Efficient Feature Extraction Technique
Digital processing of speech signal and speaker recognition algorithm is very important for fast and accurate automatic voice recognition technology. A direct analysis of the voice signal is complex due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal. The non-para...
متن کاملLinear and non-linear fusion of ALISP-based and GMM systems for text-independent speaker verification
Current state-of-the-art speaker verification algorithms use Gaussian Mixture Models (GMM) to estimate the probability density function of the acoustic feature vectors. They are denoted here as global systems. In order to give better performance, they have to be combined with other classifiers, using different fusion methods. The performance of the final classifier depend on the choice of the s...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کامل